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Particle Swarm Optimization Based Source Seeking 

Rui Zou, Vijay Kalivarapu, Eliot Winer, James Oliver, Sourabh Bhattacharya 


Abstract —Signal source seeking using autonomous vehicles is 
a complex problem. The complexity increases manifold when 
signal intensities captured by physical sensors onboard are noisy 
and unreliable. Added to the fact that signal strength decays 
with distance, noisy environments make it extremely difficult to 
describe and model a decay function. This paper addresses our 
work with seeking maximum signal strength in a continuous 
electromagnetic signal source with mobile robots, using Particle 
Swarm Optimization (PSO). A one to one correspondence with 
swarm members in a PSO and physical mobile robots is estab¬ 
lished and the positions of the robots are iteratively updated 
as the PSO algorithm proceeds forward. Since physical robots 
are responsive to swarm position updates, modifications were 
required to implement the interaction between real robots and 
the PSO algorithm. The development of modifications necessary 
to implement PSO on mobile robots, and strategies to adapt to 
real life environments such as obstacles and collision objects are 
presented in this paper. Our findings are also validated using 
experimental testbeds. 

Note to Practitioners: Abstract —This paper was inspired by 
the source seeking problem when the signal source is very 
noisy and non-smooth. We focus our work particularly on 
electromagnetic source, but the strategies proposed in this paper 
can be applied to other types of sources. Most existing strategies 
approach this problem with gradient-based methods using one 
or more mobile agents. These methods either assume the signal 
profiles to be smooth or use complicated and computationally 
costly procedures to obtain accurate estimation of gradient 
information. This paper suggests approaching the problem using 
a heuristic method which is simple to implement on robots with 
limited computation capability. We propose some modifications 
to a population based optimization technique Particle Swarm 
Optimization (PSO), so as to adapt it a real-world scenario 
where a group of mobile agents trying to find an unknown 
electromagnetic source. The mobile agents know their own 
positions and can measure the signal strength at their current 
positions. They can share information and plan for the next 
step based on individual and group memory. We then propose a 
complete solution to ensure the effectiveness of PSO in complex 
environments where collisions may occur. We incorporate static 
and dynamic obstacle avoidance strategies in PSO to make it 
fully applicable to real-world scenario. In the end, we validate 
the proposed method in experiments. In our future work, we will 
work on improving the efficiency of the method. 

Index Terms —Particle Swarm Optimization, Source Seeking, 
Swarm Robots 


I. Introduction 

Seeking a source with autonomous vehicles is an area of 
growing interest and wide applications. The source could be 
an electromagnetic signal, acoustic signal, thermal signal, or 
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a chemical/biological agent. Motivated from source-seeking 
behavior exhibited by natural species from a microscopic level 
□ to a macroscopic level 0, researchers have developed 
robots m and sensor networks ED that can imitate these be¬ 
haviors in order to perform complex tasks such as environment 
monitoring, search and rescue operations, explosive detection, 
drug detection, sensing leakage of hazardous chemicals, pol¬ 
lution sensing and environmental studies. 

In this work, we address the problem in which a team 
of mobile agents, called the seekers, attempt to find the 
location of a source that emits an electromagnetic signal 
of unknown strength. The seekers can continuously sense 
the signal strength transmitted by the source at their current 
positions which generally decays with distance from the 
source. The decay profile of the signal strength is very noisy 
as shown in Figure [I] which makes many existing methods 
inapplicable. Based on this information, we investigate the 
issue of modifying Particle Swarm Optimization (PSO) and 
applying it to swarm mobile robots to approach the source 
seeking problem. 

A vast amount of research has been done on source seeking 
with autonomous agents based on the idea of gradient de¬ 
scent/ascent. Many methods employ a single agent to search 
for single or multiple, static targets. Authors of 0 propose 
methods of computing the gradients of the Cramer-Rao bound 
on the location error with respect to a sensor’s coordinates 
and moving the sensor opposite to the corresponding gradient 
directions. They extend their techniques in 0 to provide 
motion planning strategy for a mobile sensor used for detecting 
low concentration vapors. This method assumes very small 
sensing noise in the measurement model, and its applicability 
to a very noisy model is not known. The same issue is 
encountered by the algorithm implemented on Autonomous 
Underwater Vehicles to localize hydrothermal vents in [[7) . The 
method of estimating local gradient by taking measurements 
on a circle may no longer be effective on such a noisy model. 
In [81, the authors utilize the difference in RSSI received 
by a rotating directional antenna on the single wing of the 
Samarai MAV to obtain gradient. However, the application of 
this method is restricted by the dynamics of the Samarai MAV 
whose entire body including the mono-wing rotates at all time 
for stable hovering (9|. For other types of robots, additional 
structure has to be made, and extra energy has to be allocated 
to rotate the antenna at all times which is neither convenient 
nor cost-efficient. 

Recently, extremum seeking flol techniques have been 
adopted aiming at the source seeking problem. It has been ap¬ 
plied to nonholonomic vehicles in both 2-D ifTTll . [T21 and 3-D 
GH environment using sinusoidal and stochastic perturbation 
ED. The work in m also addresses the issue of stochastic 
noise in measurement. It uses the methodology of stochastic 
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Fig. 1: Map of RSSI (Received Signal Strength Indication). 
The color bars on the right indicate the signal strength in dBm. 
In this map, the source is located in the center where the RSSI 
is approximately -29 dBm 


approximation to deal with colored noise. Nevertheless, in 
all the extremum seeking related work, only a single vehicle 
is used to collect measurements at different locations which 
is time-consuming. In addition, the trajectories generated by 
extremum seeking always demand costly maneuvers. Authors 
of ED propose strategies to deploy a group of vehicles around 
the source while applying extremum seeking. However, their 
focus is on achieving a formation distribution of the vehicles 
in accordance with signal strength instead of taking advantage 
of the vehicle swarm in collecting measurements. Therefore, 
this work is not a favorable example of multi-agent source 
seeking. 

The advantages of robot swarm and sensor network attract 
many researchers to study them and apply them to the source 
seeking problem. In (TtI . lfl8l and (T9) . a team of agents 
implement a consensus algorithm to maintain a particular 
formation to track the gradient of the source. This method as¬ 
sumes the formation of the swarm being maintained perfectly 
which is too ideal to achieve. Cooperative source seeking 
algorithms are proposed in (20), (4), RTl and (22) . In (20lL 
the authors provide algorithms and experimental validation of 


a switching strategy for a team of agents trying to localize 
a source. Each agent switches from individual exploration to 
cooperative exploration only when individual gradient estimate 
is not available. A distributed coordination algorithm based 
on adaptive control is proposed in 1221. However, it does not 
take noise into consideration which is crucial to our problem. 
In a and ED, cooperative control is applied to deploy 
agents in a way which is optimal for gradient estimation. But 
these two methods together with all other methods in this 
paragraph are all prone to be trapped in a local minimum. 
Since all of them deploy the swarm or network in a close 
neighborhood, they lack the global information of the model. 
Authors of (23) and (24) apply stochastic approximation to the 
problem and enable the swarm to find the source in complex 
and noisy environment. But the computation complexity of 
the method hinders its implementation on some cheaper and 
less capable robots. There are other source seeking methods 
that are non-gradient based. Some are developed by obtaining 
source functions. In [25], (26], (27), (28) and [29], researchers 
formulate the source seeking problem as an inverse problem. 
Depending on the source types, heat and wave equations are 
commonly used as candidate source functions. Parameters 
of source functions are found by optimizing the difference 
between collected data and simulated data from the candidate 
function. The source can be located after obtaining the source 
function. For instance, in (26) . the incoming directions of 
waves are obtained after solving the inverse problem. By 
tracing back the wave direction rays, the source is located 
at the intersections of the rays. This method cannot be applied 
to our problem because it requires a priori knowledge about 
the candidate function that governs the decay profile of the 
source, and some information about the source like wave 
length and frequency. In our problem, only signal strength 
can be measured, and the signal decay profile is unknown. 
For the same reason, the statistical signal-processing technique 
‘independent component analysis’ (30l and statistical methods 
ED, Q, (32) used to construct a maximum likelihood map 
of the source location cannot be used for our problem. In 
(33), researchers use directional an tennas to obtain bearing 
measurements of tagged fish, and localize its location by 
triangulation. The method has been applied effectively in 
localizing the invaded carp in a lake. But it is not applicable 
to our problem since we can only access scalar measurements 
instead of bearing ones. In this paper, we address this problem 
using Particle Swarm Optimization (PSO) which does not 
require any a priori knowledge of the signal model emanating 
from the source. 

PSO is a heuristic non-gradient based strategy, an evolution¬ 
ary computation technique. It was first proposed by Kennedy 
and Eberhart ED who were inspired by the behavior of bird 
flock and fish school. Ever since then many variations of 
PSO have been proposed by researchers, like inertia weight 
PSO (35) . constriction PSO (36), neighborhood PSO (37) in 
its early time, and Quantum behaved PSO (38) and Digital 
Pheromone PSO [39] developed recently. 

As a swarm optimization technique, PSO has been applied 
to some source seeking tasks involving mobile robots. In (40) 
and ED, PSO is modified to adapt to multi-robot search. 

























































































































3 


The authors discuss the limitation posed by physical robots 
and conduct simulations with several communication models. 
However, these simulations are limited to some benchmark 
functions rather than real world signal sources. Since real 
sources such as electromagnetic signals, odor and heat sources 
are considerably different from the benchmark functions, the 
results in these papers are not remarkably useful to implement. 
In l42l and |43), the authors incorporate potential field-based 
motion planning method with PSO, and propose strategies of 
localizing static and moving odor source in complex envi¬ 
ronment. While their strategies have been proven effective 
in simulations for localizing odor sources, we would like 
to focus our attention to electromagnetic sources. We will 
also explore different variations of PSO and compare the 
performance of various parameter configurations, topology 
models and obstacle avoidance strategies. In essence, our work 
focuses on finding the most effective PSO variation to solve the 
electromagnetic source seeking problem, and validate it in real 
experiments. Authors of lf44l also use RF signals as sources to 
be sought. But this work does not consider a more complex en¬ 
vironment where obstacles exist, and is limited to simulations. 
A method for obtaining optimal PSO parameters is proposed in 
m. Essentially, it applies PSO at a lower level to seek for the 
source, and at a higher level to seek for the optimal parameter 
configuration. This iterative way of finding optimal parameters 
is reasonable computationally, but not applicable to real robots. 
It requires a significant number of trials at the lower level 
which leads to enormous amount of experimental data. The 
effort required to obtain the optimal parameter configuration in 
a specific scenario is extravagant for the simple goal of finding 
the source, especially when this optimal configuration can 
hardly be applied to a different scenario. f46l provides some 
experimental results of implementing a modified PSO on real 
robots. They use a diffuse light source as the target and provide 
a simple strategy of dealing with obstacles. The experiments 
illustrate the efficacy of implementing PSO on real robots, 
but are constrained to a specific PSO configuration. In this 
work, we present extensive simulation and experimental results 
of various PSO variations and configurations, and provide 
suggestions on parameter selection. 

This work is based on our previous exploration B71 . ED 
on applying PSO to the source seeking problem. The main 
contributions of this paper are as follows. 1) We use a non¬ 
gradient based technique for the source seeking problem due to 
the inherent irregularity in the signal model. 2) We incorporate 
physical constraints posed by robots in the implementation 
and evaluation of PSO. 3) Guidelines are presented to choose 
proper parameters for several PSO variations. A strategy which 
enables PSO to be implemented experimentally in a complex 
environment is first presented in this paper. 

This paper is outlined as follows. In Section |II| we provide 
some background information regarding the problem descrip¬ 
tion and the main concept of PSO. In Section [In| we evaluate 
and compare three PSO variations with different parameters. 
In Section [Iv| we propose collision avoidance strategies for 
implementing PSO in environments containing obstacles. In 
Section [VJ we present a description of the experimental setup, 
and discuss the implementation results. Finally, we conclude 


this work in Section [Yj 


II. Background 


A. Problem Description 

Consider a point source located on a plane continuously 
transmits/emits a signal. Based on the assumption that a static 
source is present in the vicinity, a group of mobile agents, 
called seekers , explore the environment to locate the source. 
The scenario is similar to a colony of ant swarms trying to 
locate a food source. The seekers are assumed to be holonomic 
kinematic agents with maximum speed u max . The seekers have 
the capability to measure the strength of the signal emitted 
by the source at their current locations. However, the seekers 
have no information about the current location of the source, 
its signal strength and its decay profile. The objective of the 
seekers is to find the location of the source which is assumed 
at the location where the signal strength is maximum. 

For most sources, signal intensity normally decays radially 
as the distance to the source increases. The decay profiles of 
some common sources are shown below: 

• Let P denote the power at which an electromagnetic 
source emits a signal. The decay profile of the signal 
intensity is given by the following equation [49] 


cP 

A ~ (IT W 


( 1 ) 


where Pa is the power of the signal measured at a point A 
on the plane located at a distance d from the source, c and 
a are constants that depend on the physical parameters 
of the medium through which the signal is transmitted. 

• The concentration of a chemical c(r, t) at a point r 
emitted from a point source located at p emitting vapors 
at a constant rate of p Kg/s is given by 0 


c(r, t) 


Aim | r — p\ 


erfc 


/ \r~p\ \ 

\2\//c(i - to)) 


( 2 ) 


where n is constant diffusivity in m 2 /s. If we ig¬ 
nore the complementary error function erfc(x) = 
(2/y/ir) T° e~ y dy and only consider the dominant part 
on its left in equation ([2]), the substance concentration is 
inversely proportional to the distance between r and the 
vapor source. 

• For a spherical sound source, the acoustic intensity I r at 
a point with a distance of r from the center in the radial 
direction is given by [50] 


r 47r r 2 

However, in reality the measured signal intensity is too 
noisy to be accurately described by these decay profiles. 
For instance, reflection, refraction, multi-path fading, etc. can 
influence the decay profile dramatically making the actual one 
highly different from the theoretical one. 

Figure [T] illustrates a real RSSI (Received Signal Strength 
Indication) map of an RF source provided by an XBee®ZB 
RF module on a 5 mx5 m plane. The XBee module was 
located at the center of this area, and the measurements were 
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taken by another XBee module. The figure clearly illustrates 
the fact that the real RSSI profile has many local extrema 
and is non-differentiable almost everywhere contrary to the 
theoretical decay profile described by Q shown in Figure [2] 



5000 

4500 

J0GC 

3500 

3000 

I 2500 
>- 

2000 

1500 

1000 

500 


1000 2000 3000 

X [mm) 


4000 


5000 




-25 

-30 

-35 

-40 

-45 

-50 

6000 


5000 


evolves iteratively in the search-space trying to improve the 
solution in the following manner: 


v 


k 

i +1 


X 


k 

*+1 


v k + U(0,c 1 )(Pbest k -x k ) 
+U{0,c 2 ){Gbest-x k i ) (3) 

x i + v i +1 

(4) 


where x k +1 and v k +1 represent the position and velocity of 
the fcth particle in the i + 1th iteration, U (0, c\) and U (0, C 2 ) 
are uniformly distributed random numbers within [0, c\\ and 
[0, C 2 ], and Pbest and Gbest are best previous position of 
a particle and best previous position in the swarm. A best 
previous position is where a particle obtains the minimum cost 
in its search history. In our case, the above equations can be 
interpreted in the following way: Assuming n seekers as n 
particles moving in the search-space X , the position of the 
kth seeker in the ith iteration is denoted as x\ G A C M 2 . 
The cost function / : M 2 -A R incurred by each seeker is the 
negative of the signal strength received at its current location. 
The objective of the seekers is to communicate, and move 
in a manner so as to reach the global minimum of the cost 
function. 

We initialize the position and velocity of each seeker with 
a uniformly distributed vector x\ in the search-space and 
v\ within given bound. Each seeker is assumed to have the 
knowledge of its own best previous position and the global best 
previous position based on the assumption that each seeker 
has the memory to store its own previous experience and can 
benefit from the previous experience of all other members. 
Therefore, in d3|, velocity v i +1 consists of three terms: the 
effect of seeker’s previous velocity, its best known position 
and global best known position. 


III. PSO Modifications and Variations 


Fig. 2: Theoretical decay profile of an electromagnetic source 

Therefore, an optimization method not limited by differen¬ 
tiability requirement, yet able to search highly multi-modal 
design spaces is desired for direct RSSI measurements, as 
portrayed in Figure 1. Also since mobile robots are used within 
the environment to seek maximum signal source, a population 
based method where each population member has a one to 
one correspondence with a mobile robot is favored. Particle 
Swarm Optimization (PSO), as described in Section II B, has 
the ingredients required to address the above challenges. 

B. Original PSO 

In this subsection, we provide a brief description of the 
concept of PSO. PSO is a population based search algorithm 
first proposed in l34l by Kennedy and Eberhart through 
simulation of a simplified social model. Although PSO was 
originally designed to solve minimization problems, it can 
be used to find the maximum of a function, with a simple 
change. It is initialized with a number of random solutions, 
called particles. Each particle is also randomly initialized with 
a velocity within some user designed range. Each particle 


In this section, we will introduce some physical constraints 
into PSO and compare the performance of three different PSO 
variations and provide guidelines on parameter selection. 

First, we shall consider the bounds to the search space. 
As an optimization technique, boundary conditions exist in 
PSO. However, different actions are taken when particles vi¬ 
olate boundary conditions: some discard these particles, some 
bounce them back, and some confine them to the boundary, 
etc. Discarding seekers may impair the performance especially 
when the total number of seekers is limited and every seeker 
is of significant value to performance. Therefore, we choose 
to confine seekers to boundaries, namely, 

Constraint 1: If x^ +1 £ X , then x^ +1 is set to the boundary 
point on X in the direction of v i+ V 

In PSO, there’s no constraint on the velocity of a particle. 
It is possible for a particle to fly across the entire search 
space in a single iteration. However, this does not apply to 
seekers in our case which are actually ground robots. It is more 
appropriate to treat the velocity of a particle as a step of a robot 
in our implementation which decomposes the step length into 
speed and duration. Given a sufficiently long duration, a robot 
also can move a large step which crosses the entire search 
space. However, it is at the expense of a longer searching 
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time, and a greater energy consumption, which is crucial to a 
robot with limited battery capacity. On the contrary, the step 
length should not be too small to affect the performance of 
PSO. To balance performance and efficiency, simulations to 
find a proper step length are conducted below. We denote the 
step length by v max , and check the following constraint in 
every iteration. 

Constraint 2: If \ v i+i\ > ^max? then \ V i+l I = "max with the 
direction of v i +1 unchanged. 

The aforementioned constraints apply to all simulations and 
experiments in this paper. 


A. PSO with Inertia Weight 

One variation of the original PSO is to introduce an inertia 
weight cc to the previous velocity in ([3]), which leads to the 
following equation to update velocity [35], 

v k i+1 = UiV k + U(0, d)(Pbest k -x k ) +17(0, c 2 )(Gbest - x k ) 

(5) 

According to Shi and Eberhart’s analysis in (35), the inertia 
weight is critical in balancing global and local search. If 
uj is set to zero, the seekers become “memoryless” about 
its past velocity. With seekers’ velocity only determined by 
individual and global best previous positions, all seekers would 
converge to the global best position directly making the search 
process resemble a local search. On the contrary, if uj is 
set to a larger number, the seekers are more stubborn in 
their previous velocity, which leads them to larger area of 
exploration. In other words, a larger inertia weight facilitates 
global exploration while a smaller one facilitates local ex¬ 
ploitation to fine-tune the current search area (511 Therefore, 
implementing a damping mechanism to uj contributes to better 
global exploration in the beginning stage and better local 
exploitation when the swarm is closer to the source. 

The study in l52l shows that c\ and C 2 together contribute to 
the oscillation behavior for the seekers. As the values of c\ and 
C 2 are increased, the frequency of oscillation of the seekers’ 
trajectories also increases. Hereafter, we set c\ = C 2 = 2 as 
suggested in l52l . We start with multiplying uo with a damping 
coefficient A^ as the damping mechanism, and set A w = 0.95 
as suggested in ED.© is implemented in every iteration after 
velocity is updates. Therefore, 


uji+i = A^cci, with A^ = 0.95 


( 6 ) 


We choose the swarm size to be five in Section IIII-AI 


and III-B and this will be explained in Section III-C Six 
sets of simulations with different initial uo and v max were 
conducted. In each set, we ran 1000 simulations on the real 
RSSI design space described by Figure [I] The cost function 
is defined as the negative RSSI at each point which needs to 
be minimized. Each simulation was terminated when Gbest 
remained unchanged for 20 iterations. Since the signal strength 
at the source is -28 dBm, we compared Gbest with 28 
after each simulation. In addition, we counted the number 
of iterations I and the total distance traveled by all robots 
TotalD. Additionally, the following data is also collected: 

• avgGbest denotes the mean of Gbest. 


• stdGbest denotes the standard deviation of Gbest. 

• avgl denotes the mean of /. 

• avgTotalD denotes the mean of TotalD. 

Simulation results are shown in Table [IJ where the units of 
Vmax, and avgTotalD are mm/iteration and mm, respectively. 
The first four sets illustrate the effect of increasing cci on 


TABLE I: Simulation results with different c o and u max , and 
a damping coefficient A w =0.95 


Set 

UJl 

^max 

avgGbest 

stdGbest 

avgl 

avgTotalD 

1 

2 

500 

28.1095 

0.8095 

45.639 

52017 

2 

3 

500 

28.0584 

0.5715 

46.613 

66716 

3 

4 

500 

28.0358 

0.3631 

47.187 

77312 

4 

5 

500 

28.0215 

0.1187 

47.855 

84749 

5 

2 

1000 

28.0378 

0.3553 

48.313 

101966 

6 

3 

1000 

28.0272 

0.1507 

50.747 

133442 


the performance of the searching algorithm. As cci increases 
from 2 to 5, avgGbest gets closer to 28, which means the 
seekers perform better in locating the source. Meanwhile, 
a decreasing stdGbest represents growing reliability of the 
algorithm which is another indicator of improved performance. 
This improvement can be supported by the fact that larger c o 
facilitated global exploration. With a larger initial cc, seekers 
tend to preserve their previous velocity, and explore a larger 
area in early iterations. Therefore, they are less likely to be 
trapped in a local minimum, and more likely to find the 
global minimum. However, the improved performance is at the 
expense of higher energy consumption. Though the average 
iterations avgl is not clearly related to the change of cci, 
avgTotalD in set 4 is about 2.5 times that of set 1. Figures 
[3] and [4] are good demonstrations of the reason, in which 
seekers are represented by different colors and the small circles 
represent initial positions. It is clear from the figures that 
trajectories with cci = 4 cycle in a larger area and converge 
slower than that of cci = 1. So we can clearly see a trade-off 
between performance and energy consumption. 

Sets 5 and 6 are used to reveal the influence of v max . Com¬ 
paring sets 5 and 6 with 1 and 2, we find slight improvement 
in the avgGbest and stdGbest when ?; max doubles. However, 
the average total distance traveled also doubles. Moreover, 
if we take set 4 into consideration, apparently, increasing uj 
is a better strategy than increasing u max in terms of both 
performance and energy efficiency. 


B. PSO with Constriction Factor 

Another variation similar to PSO with inertia weight that 
is implemented to the source seeking problem in this paper is 
PSO with a constriction factor. Introduced by Clerc in [36], the 
constriction factor is used to prevent “explosion” and ensure 
convergence of PSO. Equation 0 and ^ describe the basic 
concept of the constriction factor. 


uf +1 = K[v!? + U(0, ci)(Pbest k — Xi) + U(0,c 2 ) (Gbest — x\)\ 

(7) 


K = 


2 

2-0- vV 2 - 40 


where 0 = c\ +C 2 , 0 > 4 (8) 
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Trajectories of Seekers Trajectories of Seekers 




(a) Trajectories of seekers with u — 1 (a) Trajectories of seekers with uj — 4 




(b) Statistics with uj — 1 


(b) Statistics with uj — 1 


Fig. 3: Trajectories of seekers with uj = 1 


Fig. 4: Trajectories of seekers with uj = 4 


Compared to the original PSO, the entire RHS of ([3]) is 
multiplied by a coefficient AT, called the constriction factor. 
K is a function of ci and c 2 as shown in ([8]). The main idea 
of constriction PSO is to take advantage of the mathematical 
nature of ([3} which guarantees the convergence of the algo¬ 
rithm. Detailed explanation of the mechanism of constriction 
PSO can be found in l36l which is beyond the scope of this 
paper. 

A closer look at 0 reveals that it is a special case of 
0, whose inertia weight c j is set to K , and ci and C 2 are 
multiplied by K. It is the relation between 0 and K that 
prevents the swarm from “explosion”. Therefore, according 
to Clerc, t; max is not necessary when the constriction factor is 
applied. However, for application reasons that were mentioned 
before, we would keep t; max to a smaller value to improve 


the energy efficiency of the robots. Since K is a decreasing 
function of 0 whose supremum is 1. If we think of it in terms 
of ([5]), the supremum of cc is 1. This suggests the constriction 
PSO does not emphasize global exploration at the initial stage 
of the search. And it does not favor local exploitation, either, 
since K does not vary through the search. 

We also conducted 6 sets of simulations on the constriction 
PSO algorithm. In all sets, c\ and C 2 are set to the same 
value of 0/2 to balance the influence of individual and swarm 
experience. All configurations are identical to those in the 
previous section unless otherwise specified. Data is collected 
in Table [II| Sets 7, 8, 9 and 10 show the impact of decreasing 
0, or increasing K. As K increases, growing emphasis is put 
to the term of previous velocity in 0. Therefore, the seekers 
tend to explore larger area and have a higher chance of finding 
















TABLE II: Simulation results of constriction PSO 


Trajectories of Seekers 

K=0 90488, o =4.01, Cl =2.005, c 2 =2.005 
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Set 

K 

4 > 

^max 

avgGbest 

stdGbest 

avgl 

avgTotalD 

7 

0.5 

4.5 

500 

29.2826 

2.4494 

51.443 

15837 

8 

0.73 

4.1 

500 

28.6432 

1.9508 

54.338 

32282 

9 

0.8 

4.05 

500 

28.5366 

1.6601 

49.583 

44542 

10 

0.90 

4.01 

500 

28.3358 

1.4592 

43.043 

62037 

11 

0.73 

4.1 

1000 

28.2765 

1.1675 

48.535 

52972 

12 

0.8 

4.05 

1000 

28.1730 

0.9260 

45.996 

77154 


the source. This improvement in performance is evident in 
these 4 sets, as both avgGbest and stdGbest decrease with 
K. And we can see avgTotalD also grows with K regardless 
of how avgl varies. This is also the result of favoring global 
exploration, since the seekers “fly” longer distance in each 
iteration when they emphasize exploration. 

In set 11 and 12, we keep K equal to that in set 12 and 13, 
and only double ?; max . We can see significant improvement 
in performance when v max doubles. Because this allows the 
entire RHS of 0 to be doubled, including the term for 
previous velocity which puts emphasis on global exploration 
in another way. However, this improvement is not seen in set 5 
and 6 in PSO with inertia weight. A reasonable guess may be 
that when the coefficient uo is large, the performance is mainly 
influenced by uj rather than u max . 

Observation on the trajectories of seekers reveals another 
feature of constriction PSO which can be seen in Figure [5] This 
figure illustrates one typical simulation result where K = 0.9. 
In this simulation, the source is found after about 40 iterations, 
however, the swarm does not converge to the source after that 
as in Figures [3] and [4] Instead, all seekers keep oscillating 
around the source showing no sign of convergence. Comparing 
this to those sets on PSO with inertia weight, we can find that 
the violent oscillation in Figure [5] actually roots in the lack of 
a damping mechanism in constriction PSO. With a constant 
coefficient for the previous velocity, the swarm is incapable of 
switching from favoring global exploration in initial stage of 
the search to favoring local exploitation in later stage. 

Based on the collected the data, our preliminary judgment 
is PSO with inertia weight is better suited for our application. 
Moreover, 1/10th of the length of the search space is a 
reasonable value for ?; max . As for the inertia weight, any value 
between 2 and 4 should produce some good results. 

C. SPSO 

The last PSO variation studied in this paper is Standard Par¬ 
ticle Swarm Optimization (SPSO). It is a substantial improve¬ 
ment to the original PSO published in 1995, and researchers 
that developed their own PSO implementations benchmark 
their methods performance against SPSO. The implementation 
of SPSO 2006 can be found here (53). In this section, we will 
first provide a brief description of SPSO 2006, then study three 
SPSO topology models. 

The velocity update equation in SPSO is almost the same 
as Equation except that Gbest is replaced with Lbest - 
best previous position in the neighborhood, as shown in the 
following equation. 

v k i+l = uv k + f/(o, c)(Pbest k - x’l) + {7(0, c)(Lbest k - x k ) 

( 9 ) 
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Fig. 5: Trajectories of seekers with a constriction factor of 
K = 0.9 


As a benchmark variation, there are generally accepted values 
for all the parameters in SPSO. The swarm size is determined 
by 10+ [2 a/D], where D is the dimension of the search space. 
So we use 12 seekers in this subsection. Other parameter 
values are 


C'jJ 

c 


2 ln(2) 


: 0.721 


- +ln(2) « 1.193 


Please refer to ED for detailed description on initialization 
and confinement of SPSO. 

A noticeable distinction in SPSO is the introduction 
of neighborhood. Neighborhood defines the communication 
topology among seekers. In this subsection, we will study and 





















































compare the implementation of three commonly used models 
on the source seeking problem. 



TABLE III: Simulation results with different topology models 


Set 

Topology 

avgGbest 

stdGbest 

avgl 

avgTotalD 

13 

ring 

28.000 

1.82E-05 

29.970 

68380 

14 

fully connected 

28.000 

6.74E-04 

29.331 

65475 

15 

K = 3 

28.000 

2.20E-04 

29.259 

68860 

16 

K = 6 

28.001 

3.16E-02 

28.671 

66913 

17 

K = 12 

28.002 

4.47E-02 

29.127 

65212 


One plausible reason for this inconclusive result may lie in the 
number of seekers. 12 seekers seems to be excessive for our 
implementation making the influence of topology model and 
other parameters negligible. For the same reason, we only used 
five seekers in previous subsections to distinguish influence of 
those parameters of interest. In future implementations, we 
would prefer the fully connected model for simplicity reason. 




(c) Adaptive random topology 
Fig. 6: Graphs of different topologies 


Figure |6] present the graphs of all three models. Figures 
[6ja) and[6p3) are self-explanatory. Figure [6jc) is the adaptive 
random topology model |[55l when K = 3. In this model, each 
particle informs K random particles and itself of its Pbest , 
which means it informs at most K + 1 different particles 
and at least one particle (itself). For instance, in Figure [6c| 
particle 6 informs particle 2 and itself and has 5 informants 
{1,2, 3,4, 5}. Lbest of a particle is defined as the best Pbest 
among all its informants. This graph changes after every 
unsuccessful iteration (no improvement in Gbest). 

To compare the aforementioned topology models, we con¬ 
ducted five sets of simulations with different models. Table Hill 
collect all simulation data. Surprisingly, there’s no distinguish¬ 
able difference among these various models either in terms of 
Gbest or avgTotalD. Consequently, we cannot draw any solid 
conclusion on the superiority of one model over the others. 


IV. PSO in Complex Environment 

In previous implementations, the source seeking task is 
carried out in an ideal obstacle-free environment. However, in 
real-world, we have to cope with obstacles as well as collisions 
among seekers which can be modeled as collision avoidance in 
the presence of dynamic obstacles. Therefore, we decompose 
the obstacle avoidance problem into two stages to deal with 
static and dynamic obstacles, respectively. 


A. Static Obstacles 

Static obstacles are common in a search environment. Con¬ 
structions and uneven terrain are all potential static obstacles 
for seekers. We will give a short description of two static 
obstacle avoidance strategies proposed in our previous work 
[471, [48]. Then we will integrate them into SPSO and 
compare their performance in simulations. 

Obstacles are described as simple convex or concave poly¬ 
gons in the search space as shown in Figure [7] The red 
star in the center represents the source. Seekers are provided 
with the information about each obstacle’s position and size 
beforehand. The main idea of integrating obstacle avoidance 
into SPSO is to add a new operation mode to the seekers. They 
operate in the regular mode implementing SPSO when their 
trajectories do not collide with obstacles, and switch to obsta¬ 
cle avoiding mode when there’s potential collision. Strategy 1 
inherits the heuristic nature of PSO. It introduces a step with 
a specific length and a random direction into PSO when an 
obstacle lies in the next step of a seeker. We set the length of 
this random step to be the “diameter” of the obstacle so that 
the seeker has a good chance of circumvent the obstacle in one 
step as shown in Figure [8a| Here diameter refers the largest 
distance between any two points on the obstacle. Let Dj 
denote the diameter of the jth obstacle. Algorithm [l] presents 
the procedure of this strategy. It is executed whenever a new 
step is generated for a seeker by PSO. In other words, collision 
with any obstacle is always checked for every step from x\ 

Figure 


to x^ +1 before it is executed 


8b 


demonstrates the 


trajectories of 12 seekers implementing Strategy 1 in SPSO. 
Different seekers’ trajectories are represented by different line 
styles. denotes the initial position of each seeker, and red 
“x” represents a potential collision with an obstacle. 
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Fig. 7: Map with obstacles 


Algorithm 1 Static Obstacle Avoidance Strategy 1 

l: if is in the j the obstacle then 
2: repeat 

3: set Vi +1 to a random direction and let \v!? +1 \ = Dj 

4: 4+1 = 4 + 4+1 

5: until x^ +1 is not in any obstacle 

6: end if 

7: Proceed with the normal PSO 


Strategy 2 is a variation of the Bug 1 algorithm [56]. 
Instead of knowing the position of the goal, only the signal 
strength at its current position is known to a seeker in our 
case. Once a seeker switches to obstacle avoidance mode, 
it starts to circumnavigate the encountered obstacle. As it 
circumnavigates, it measures the signal strength along its 
path. After circumnavigating the entire obstacle, the seeker 
follows the shortest path on the boundary to point at which 
it measures the largest signal strength and implements regular 
SPSO. Although in our case, it is not guaranteed that the seeker 
would end at the closest point to the source on the obstacle’s 
boundary as in the Bug Problem , it is highly likely to be on the 
side of the obstacle which is closer to the source. Because the 
source signal strength generally decays with distance, though 
it is quite noisy and does not strictly follow a decay profile. 
This provides the basis of implementing the Bug 1 algorithm 
and prevents the seeker from going back to the same obstacle. 
Figure [9a] illustrates the trajectory of a seeker implementing 
the “Bug 1” algorithm to avoid obstacle. And Figure [9b] 
presents the trajectories of 12 seekers implementing Strategy 
2 in SPSO. 

Now we provide more simulation results to compare these 
two obstacle avoidance strategies. We conducted 4 sets of 
simulations. Set 18 and 19 used the parameters in set 2, and 
set 20 and 21 used the fully connected topology. Simulation 
results are collected in Table HVl 

Strategy 1 outperforms Strategy 2 in both avgGbest and 


Obstacle 



* 

\ 

\ 




(a) Strategy 1 

Trajectories of Seekers 



(b) Trajectories of seekers 
Fig. 8: Static obstacle avoidance - Strategy 1 


TABLE IV: Simulation results for two obstacle avoidance 
strategies 


Set 

Variation 

Strategy 

avgGbest 

stdGbest 

avgl 

avgTotalD 

18 

Inertia 

1 

28.0229 

0.3779 

45.3947 

68475 

19 

Inertia 

2 

28.2634 

1.2469 

66.3155 

66400 

20 

SPSO 

1 

28 

2.65E-04 

33.776 

74017 

21 

SPSO 

2 

28.0826 

0.4034 

35.867 

50034 


stdGbest for both PSO variations. Very small standard devi¬ 
ation suggests the high reliability of Strategy 1. The reason 
Strategy 1 ends with longer distance is that its random step 
is usually larger than v max because of the size of obstacles. 
While in Strategy 2, seekers usually take steps shorter than 
i; m ax when circumnavigating obstacles. Overall, Strategy 1 is 
better than Strategy 2 in simulations. Moreover, performance 
distinction originated from different variations is more signif¬ 
icant than from different strategies. This primarily result from 
the size of the swarm. 
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(a) Strategy 2 

Trajectories of Seekers 



(b) Trajectories of seekers 

Fig. 9: Static obstacle avoidance - Strategy 1 (Bug 1 Algo¬ 
rithm) 


B. Dynamic Obstacles 

In all previous simulations, seekers are assumed to be points 
on a plane. However, in practice they have a finite area. This 
makes dynamic obstacles avoidance an inevitable issue in 
the application of swarm robots since every robot acts as a 
dynamic obstacle to others. To deal with this problem, we 
add two steps to the obstacle avoidance mode. 

During each iteration, after x^ +1 are generated by PSO and 
checked or modified using the static obstacle avoidance strat¬ 
egy, potential collisions among seekers need to be checked. 
In this stage, there are two possible kinds of collisions: (1) 
Collisions at seekers end points; (2) Collisions in seekers 
trajectories. Since the seekers are assumed to dimensionless 
point particles in PSO, the algorithm needs to be modified 
to take into account possible collision between the robots 
at the end of their paths in a real scenario. Some seekers 


maybe too close to fit in the real robots causing collisions 
at these end points. In order to circumvent this problem, we 
incorporate a model that forces the seekers to repel each 
other to rearrange their end points to avoid collision. This 
is described in Algorithm [2] 


Algorithm 2 End point arrangement using repulsive force 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 
17 


S is the set of seekers 
R is the radius of a seeker 
t is a scaling factor 

while 3 \x p i+1 - x q i+1 \ < 2 R, p, q e S,p ^ q do 

for each k e S do 

for each j G S.j ^ k do 

rf - ry* _ ry>^ 

a ~ x i +1 1 

if \d\ >= 2 R then 
Force(k,j ) = 0 

else 

Force{k, j) = d(2R — |d|)/|d| 

end if 
end for 

Force(k) = T,jes,j^k Forc e(k,j) 
x^ +1 = x\ +l + tForce(k) 

end for 
end while 


Algorithm [2] ensures safe distance between any two seekers, 
and avoids end point collision. After this, if any seeker happens 
to lie in the path of others, the second step is activated. In 
this mode, seekers move sequentially. Only one seeker moves 
at a time while others stay still. We treat all other seeker 
as rectangular obstacles. We construct a reduced visibility 
graph E3 from the current position x\ of the activated seeker 
to its next position x^ +1 . Finally, by applying the Dijkstra’s 
algorithm [58j], we generate the shortest path from x\ to x^ +1 . 

Figure [TO] presents an example of the visibility graph and the 
shortest path. Due to the finite non-zero area of a seeker, the 
boundaries of obstacles and stationary seekers are expanded to 
the black dashed line to ensure a safety zone for the activated 
seeker (Minkowski sum of the obstacles with the seekers). The 
solid black lined delineate the visibility graph. The red dashed 
line represent the shortest path between x\ and 

So far, we have proposed a complete solution to implement¬ 
ing PSO on real robots in a complex environment where there 
exist potential collisions. In the next section, we will describe 
the experimental setup for implementation. 

V. Experiments 

Our testbed is built on a 5 mx5 m area covered by the 
Vicon tracking system. This system provides accurate position 
information of robots by recognizing markers on the robots 
serving as an indoor GPS system. The source is an XBee 
module hanging in the middle at a height of 20 cm above 
floor. We do not place it on the floor in order to avoid 
potential collision with the robot. Robots used in experiments 
are small differential-drive robots modified from the Parallax 
Shield-Bot controlled by Arduino. Each robot is equipped 
with an XBee module to measure RSSI. Figures m and [12] 
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Fig. 10: Visibility graph and shortest path 


are pictures of the testbed and robots. Figure [13] illustrates 
the complex environment with obstacles in which experiments 
were conducted. 



Fig. 11: Testbed 



Fig. 12: Experiment environment with obstacles 

In the experiments, we built a centralized system with a 
computer being the center collecting and distributing infor¬ 
mation from and to all robots. This is not necessary since 
the strategies proposed in this paper are not computationally 
expensive and can be implemented on these robots without 
a strong computation ability. Also, since each robot can also 
communicate with each other, this system can work effectively 
without a central unit if robots have access to their own 
positions. 



Fig. 13: Close look of robots and source 

Two successful experiments were recorded in the video. In 
these experiments, five robots were deployed to seek the source 
implementing the proposed strategies in an environment with 
obstacles. The parameters were chosen to be the same as set 
2 in Table H 

VI. Conclusion 

In this paper, we explored the implementation of PSO to the 
electromagnetic source seeking problem. We modified PSO 
in accordance with the physical constraints posed by robots 
and the environment. Three PSO variations were evaluated 
through simulations. We found that the inertia weight PSO is 
best suited to our implementation and provided guidelines on 
parameter selection in PSO. We extended PSO from a pure 
computation technique to a complete solution to the source 
seeking problem in complex environment. Collision avoidance 
techniques were discussed extensively in this paper, and a 
complete obstacle avoidance strategy was incorporated in PSO. 
Our work was validated eventually in experiments using real 
robots. 

In the future, we plan to explore and develop more advanced 
PSO variations that are specific for robotics applications. 
We would like to extend our work to more general source 
seeking scenarios, where sources may have different features 
and the obstacles in the environment cannot be simplified as 
polygons. Though it is unlikely that any variation can perform 
effectively in all kinds of scenarios, it is possible to explore 
the preferences of various scenarios and provide guidance in 
the selection of variations and parameter configurations. 
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